A comparison of session variability compensation techniques for SVM-based speaker recognition
نویسندگان
چکیده
This paper compares two of the leading techniques for session variability compensation in the context of GMM mean supervector SVM classifiers for speaker recognition: inter-session variability modelling and nuisance attribute projection. The former is incorporated in the GMM model training while the latter is employed as a modified SVM kernel. Results on both the NIST 2005 and 2006 corpora demonstrate the effectiveness of both techniques for reducing the effects of session variation. Further, systemand score-level fusion experiments show that the combination of the two methods provides improved performance.
منابع مشابه
Session variability compensation in speaker and language recognition
This report summarises the research work performed by the author in order to start his Ph.D Thesis which is based on robust automatic speaker and language recognition. One of the main causes of errors in automatic speaker and language recognition systems is due to intrinsic variability between sessions of a same speaker. This variability known as session or channel variability is caused by seve...
متن کاملIntra-session Variability Compensation for Speaker Segmentation
This paper addresses the problem of speaker segmentation in two speaker telephone conversations, proposing a segmentation approach based on factor analysis and a novel method for intra-session variability compensation to improve segmentation performance. The segmentation system is evaluated on the NIST Speaker Recognition Evaluation 2008 summed channel test condition, showing that intra-session...
متن کاملDiscriminant NAP for SVM speaker recognition
Nuisance Attribute Projection (NAP) provides an effective method of removing the unwanted session variability in a Support Vector Machine (SVM) based speaker recognition system by removing the principal components of this variability. There is no guarantee with the methods proposed, however, that desired speaker variability is retained. This paper investigates the possibility of training NAP di...
متن کاملSpeaker recognition experiments using connectionist transformation network features
The use of adaptation transforms common in speech recognition systems as features for speaker recognition is an appealing alternative approach to conventional short-term cepstral modelling of speaker characteristics. Recently, we have shown that it is possible to use transformation weights derived from adaptation techniques applied to the Multi Layer Perceptrons that form a connectionist speech...
متن کاملIntersession Compensation and Scoring Methods in the i-vectors Space for Speaker Recognition
The total variability factor space in speaker verification system architecture based on Factor Analysis (FA) has greatly improved speaker recognition performances. Carrying out channel compensation in a low dimensional total factor space, rather than in the GMM supervector space, allows for the application of new techniques. We propose here new intersession compensation and scoring methods. Fur...
متن کاملذخیره در منابع من
با ذخیره ی این منبع در منابع من، دسترسی به آن را برای استفاده های بعدی آسان تر کنید
عنوان ژورنال:
دوره شماره
صفحات -
تاریخ انتشار 2007